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(54) Low-power-consumption viterbi decoder 



(57) A Viterbi decoder for Viterbi-decoding an input 
signal, includes a path memory, a shift register, and a 
traceback circuit. The shift register has at least (con- 
straint length -1) bits as the number of stages. The 



traceback circuit inputs, to the shift register, the AND 
per bit between a signal generated by a decoder con- 
nected to the shift register and the content of the path 
memory designated by a traceback address counter. 
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Description 

[0001 ] The present invention relates to a Viterbi decoder having a very-low-power-consumption Viterbi decoding LSI 
and, more particularly, to a portable telephone Viterbi decoder using a CDMA system for performing Viterbi decoding 

5 even during a standly mode. 

[0002] Viterbi decoders for convolution codes have conventionally been used for satellite communication and the like 
as an error correction system having a very high random error correction capability. This system requires a very high 
operation speed and a large frame length. A Viterbi decoding LSI must perform memory truncation of truncating path 
memories at an appropriate stage. Fig. 1 shows an example of path memories and a maximum likelihood decision cir- 

10 cuit in a Viterbi decoding LSI which satisfies the requirement. As shown in Fig. 1, after the switches are switched 
between the upper and lower contracts by a path select signal Ai+1 0. the contents of shift registers are shifted between 
rows and sequentially repeatedly updated at a time. The maximum likelihood decision circuit detects a path having a 
maximum path metric at time The switch is changed over to a corresponding shift register to read its contents, 
which serve as a maximum likelihood decoded output To implement this by one chip, however, many gates must be 

is operated at a high speed, resulting in very high power consumption. 

[0003] In recent years, Viterbi decoding is being used in a digital portable telephone system and a portable telephone 
system based on CDMA. These systems use a low operation speed and a relatively small frame length, unlike satellite 
communication. 

[0004] In these systems, the power consumption must be low because their power supply is a battery for portability. 

20 As a current low-power-consumption Viterbi decoder, one having a dedicated Viterbi decision circuit incorporated in a 
digital portable telephone system DSP has been reported (see NIKKEI ELECTRONICS, No. 620, 1994, pp. 15 - 16). 
[0005] To prolong the telephone conversation time and reduce the battery capacity for enhanced portability, the power 
consumption must be further reduced. Particularly in the CDMA system for performing Viterbi decoding even during a 
standly mode, reduction in power consumption is inevitable. 

25 [0006] Of current low-power-consumption Viterbi decoders, one having a dedicated Viterbi decision circuit incorpo- 
rated in a digital portable telephone system DSP can execute two ACS operations within six cycles. However, detection 
of path memories and maximum likelihood decision necessary for Viterbi decoding are performed by software. In addi- 
tion, metric information required for ACS operation is frequently externally accessed. Particularly in soft decision, the 
number of metric information bits increases to increase the overhead. This makes it difficult to reduce the power con- 

30 sumption. . 

[0007] The CDMA system exploits a RAKE receiver. A signal decoded by phase estimation and amplitude estimation 
at each finger has a squared amplitude for both I and Q signals. In attaining Viterbi decoding robust against level vari- 
ations such as Rayleigh fading, a metric serving as soft decision information is preferably squared. For this reason, soft 
decision information has a double bit width. This further increases the overhead in the conventional decoder. 

35 [0008] The present invention has been made in consideration of the above situation, and has as its object to provide 
a small-size, light-weight Viterbi decoder capable of reducing the power consumption and improving the operability. 
[0009] To achieve the above object, according to the first aspect of the present invention, there is provided a Viterbi 
decoder for Viterbi<Jecoding an input signal, comprising a path memory, a shift register having at least (constraint 
length -1) bits as the number of stages, and a traceback circuit for inputting, to the shift register, an AND per bit between 

40 a signal generated by a decoder connected to the shift register and a content of the path memory designated by a trace- 
back address counter. 

[0010] According to the second aspect of the present invention, there is provided a Viterbi decoder compnsing a 
traceback circuit including a path memory constituted by a first RAM, an ACS circuit including a path metric memory 
constituted by a second RAM, and a branch metric circuit, wherein the traceback circuit including the path memory has 
45 a shift register having at least (constraint length -1) bits as the number of stages, a signal generated by a decoder con- 
nected to the shift register and a content of the path memory designated by a traceback address counter are ANDed in 
units of bits, and the AND is input to the shift register. 

[001 1] According to the third aspect of the present invention, there is provided a Viterbi decoder wherein the path 
memory constituted by the first RAM defined in the second aspect is divided into a plurality of blocks, at least one upper 
so bit of the shift register forms a chip select signal, a divided path memory is selected by the chip select signal, a decoding 
result of remaining bits of the shift register and a content of the path memory selected by the traceback counter and the 
chip select signal are ANDed in units of bits, and the AND is input to the shift register. 

[001 2] According to the fourth aspect of the present invention, there is provided a Viterbi decoder wherein the path 
metric memory constituted by the second RAM defined in the second aspect is divided into a plurality of blocks, each 
55 of the divided path metric memories comprises an ACS circuit, and sums corresponding to surviving paths selected by 
the ACS circuits are written in units of the divided memory groups in storing the sums as maximum likelihood of a state 
in the path metric memory. 

[001 3] According to the fifth aspect of the present invention, there is provided a Viterbi decoder further comprising an 
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ACS circuit and a traceback circuit which start operating by an ACS start signal and a traceback start signal, the ACS 
circuit and the traceback circuit being event-driven by the respective start signals. 

[0014] * According to the sixth aspect of the present invention, there is provided a Viterbi decoder wherein when path 
memory stages in the traceback circuit defined in the second aspect are truncated by an appropriate length corre- 

£ sponding to a plurality of blocks, a truncation range overlaps preceding and succeeding truncation ranges in units of 
blocks, a traceback start position is set by a maximum likelihood decision circuit, and a start block of one surviving path 
in a truncation symbol corresponding to a state selected by traceback is used as decoded data. 
[0015] According to the seventh aspect of the present invention, there is provided a Viterbi decoder wherein the path 
memory divided into the plurality of blocks defined in the sixth aspect is a ring buffer rotating in units of blocks, traceback 

io starts from last data in a last block, and data corresponding to a start block is used as decoded data. 

[0016] According to the eighth aspect of the present invention, there is provided a Viterbi decoder wherein a ring 
buffer formed by the path memory divided into the plurality of blocks defined in the sixth aspect is a RAM, and data 
update in units of blocks is realized by address control. 

[00171 According to the ninth aspect of the present invention, there is provided a Viterbi decoder wherein input data 
75 corresponding to each of the blocks in the sixth aspect is time-compressed in units of blocks, and ACS processing and 
traceback are performed at a compression timing. 

[0018] According to the first aspect, a small-size, light-weight Viterbi decoder capable of reducing the power con- 
sumption and improving the operability can be provided. 

[0019] According to the second aspect, an optimum hardware architecture can be realized with a minimum number 
20 of optimized components, compared to a combination of a DSP and a Viterbi accelerator as a conventional architecture. 
Therefore, a small-size, light-weight Viterbi decoder capable of reducing the power consumption and improving the 
operability can be provided. 

[0020] According to the third aspect, a path memory is divided into a plurality of blocks, and each divided path memory 
is selected by a chip select signal. Accordingly, a lower-power-consumption Viterbi decoder can be provided. 
25 [0021] According to the fourth aspect, since a path metric RAM is divided and comprises an ACS circuit for each 
divided metric, parallel processing can increase the speed. Also, traceback is realized by the above hardware architec- 
ture, and a high-speed Viterbi decoder can be provided. 

[0022] The Viterbi decoder of the present invention defined in the fifth aspect is of the type event-driven by ACS and 
traceback start signals. The Viterbi decoder can be released from limitations imposed by operation clocks and flexibly 
30 designed. 

[0023] According to the sixth aspect, even rf a signal has a long frame, traceback need not be frequently performed 
because path memories can be truncated in units of blocks. Therefore, a high-power-efficiency Viterbi decoder can be 
provided. 

[0024] According to the seventh and eighth aspects, even when the truncation range overlaps preceding and suc- 
35 ceeding truncation ranges upon memory truncation, a Viterbi decoder whose memory capacity is decreased by con- 
structing path memories in a ring buffer shape can be provided. 

[0025] According to the ninth aspect, since input data is time-compressed and processed in units of blocks, an event- 
driven Viterbi decoder capable of smooth traceback in units of blocks can be provided. 

[0026] The above and many other objects, features and advantages of the present invention will become manifest to 
40 those skilled in the art upon making reference to the following detailed description and accompanying drawings in which 
preferred embodiments incorporating the principles of the present invention are shown by way of illustrative example. 

Fig. 1 is a block diagram showing an example of the arrangement of a conventional Viterbi decoder; 
Fig. 2. is a block diagram showing an example of the arrangement of an embodiment of the present invention; 
45 Fig. 3 is a block diagram showing an example of an ACS circuit of the present invention including a path metric 
memory built by a RAM; 

Fig. 4 is a block diagram showing an example of a branch metric generation circuit of the present invention; 
Fig. 5 is an explanatory view showing the operation of the present invention when M stages are parallel -processed; 
Fig. 6 is a block diagram showing an example of an event-driven architecture according to another embodiment of 
so the present invention; and 

Figs. 7A and 7B are explanatory views, respectively, showing an example of path memory block truncation and the 
traceback start memory address position according to still another embodiment of the present invention. 

[0027] Several preferred embodiments of the present invention will be described below with reference to the accom- 
55 panying drawings. 

[0028] Fig. 2 is a block diagram showing the arrangement of a traceback circuit which uses a Viterbi decoder as the 
first embodiment of the present invention and includes path memories. This traceback circuit comprises a shift register 
8 having (constraint length -1) bits, counters 1 1 and 7 each having a count value corresponding to the frame length, an 
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SIP converter 2 and a latch circuit 3 respectively for converting decision results sent from ar .ACS circuit (not shown) 

fnfo paS dal arc, for storing the data in path memories 1-1 , 1-2 1-32. a decoder 9 for Reading the upper £ts 

oVdata in the shift register 8 and generating 32 chip select signals for the path memory banks 1-1 ^"^-J 
decker 5 for generalg comparison data for selecting necessary bits from data obtained from chip-selert^ _path 
fi memories Z AND circuTt 4 for ANDing the selected bits and the comparison data ,n into of brts. and selectors 6 nd 
5 ToTs?ecTng a state wherein the ACS operates to store comparison data in a path memory, or a state wherein data 

SwT^gTdeSio^ data sentfrom the ACS circuit are seriaVparallel-converted by the S/P converter 2. and the 

selection, the decision data stored in the latch circuit 3 are sequentially stored in the path memories 1-1.1 -2 1 32 

SSi?" Aft^eSme data are stored, the state selectors 6 and 10 are swrtched to extract data by traceback. The 
W*^™c£Z 7 b set in the counter 11 (setting of the last count value can be omrtted by reptaang the 
t * counters 1 1 and 7 with one up-down counter), which then counts down. 

SiSl Thelast Potion of the received signal is tail brts corresponding to 8 x 3 = 24 symbols. This | portior *«™*«* 
SSLJSSS^i All "0"s are set in the shift register 8. path memory address "1-1 " . •^J^J?? 
Se d Jer 9. and the selected data are output onto a bus 12 to the AND circuity The data are ANDed with the 
c^ate obtainSby the decoder 5 in units of bits. In this case, since all "0"s are stored ,n the shrft register 8. the f^t ^t of 
tiS p^eCyTs checked. The contents detected by the AND circuit 4 are fed a 0 " 
these contents the same processing is repeatedly performed to trace back the most probable path. 
Si ^ ^ i^a bfock digram shewing an example of the arrangement of an ACS circuit wh,ch sends deas,on data 
to the embodiment shown in Fig. 2 and includes a path metric memory built by a RAM. 

Su341 ^ T^pati7mS memory includes two memories101 arxf 105. one of which serves as an input SKles while the 
Kser^es as J o^ s^The input and output sides are switched after all states are compared. Switching ,s 

n the input^ath memory, the least significant bit of the counter is connected to the ™st s.gnrf.can brt o f the add ess. 
n he ou^ufpath memory, the least significant bit of the counter is connected to tm > least ° f 
For exarrple. state No. 0 is compared with No. 128. and the companson results are stored .n brt Nos. 0 and 1 of the 

?0^S^Tafa y tenporari.y stored in registers 1 13 and 1 10 are added by branch metric adders 1 16 and 107. the 
sunls fr^^^ 1 1 6 and 1 07 are compared by a comparison adder 1 1 7 to determme the smaller 

o^rcrrSrison «su.ts are sent to the traceba* drcurt including the above^iescribed path memories The com- 
as narison reitear^ to select a metric. A state having high maximum likelihood is selected by a selector 106. 
^^^JSTto the output path metric memory. Registers 1 1 2 and 1 1 1 read next data ,n advance ,n con- 

tOO^Tr^sTbS metric generation circuit for sending a branch metric to an example of the ACS circuit 

«. ioSst "2 ISSSSSS^I seria, data sent from a deintertaave unrt (not shown) into 3- 
Spirieldata Fo^ 

^^tZerl % 1 and 209 generate a branch metric by adding data selected by selectors 206 to 208 which operate 
Z eceX of I sr 9 na, from a generating polynomial circuit 205. Since the adders 21 1 and 209 have 
45 elating polynomial asLt on the source side, they generate selected branch metrics in «°^^^Sr 
ation of the path metric circuit. The timings of the adders 21 1 and 209 are synchronized wrth ACS using the counter 
value in Fig 3 A twos complement circuit 210 generates a paired branch metnc used for ACS. 
lowoi Fig 5 is a view for explaining high-speed ACS operation when path metric memones are divided into M branch 
stages, and the M stages are parallel-processed. /-Hdroca 
so [0041] In Fig 5 memory banks implement modulo M operation wrth respect to each state (address). 

0042 TOs wi. be explained separately for the input and output metric sides. The input s,de address pair are deter_ 
minSby a modulo option in accordance with the bank number. In Fig. 5, as the state 

number Ureases in modulo in the upper half of memories but the bank number decreases in modulo in the lowerhalf. 

is the first memory bank group, the lower half is the second memory bank group. The M/2 ACS pairs can be paralle. 
processed without any conflict. 
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[0044J The output side address pair are an address twice the input side address, and this address plus one. Address 
banks are determined by a modM operation. Assuming that the upper half address of the input source addresses is 
adrUH, one output side address is (adrUH x 2)modM + (0, 1). The upper-half memory bank group has a modulo address 
of 0 to (My2)-1 , and the lower half has (M/2) to M-1 (for e x M = 4, 0,1 and 2,3). 

[0045J For example, in the upper half, the real address is qM + {0 to (M/2)-1). One output side address is twice this 
address, and thus 2qM + (0 to M-2)even. modM is eliminated from this to obtain (0 to M-2)even for the memory bank. 
The other address is the one address plue one. Therefore, memory banks having the other type of addresses interleave 
between memory banks having one type of addresses. In this manner, the output path memory can be determined with- 
out any conflict, and the M/2 ACS pairs can be parallel-processed. 

(0046] Similarly, in the lower half, the real address is qM to {(M/2) to M-1}. One output destination address is twice 
this address, and thus 2qM + 2 • {(M/2) to M-1}even = 2q'M + {0 to M-2}even . modM is eliminated from this to obtain 
{0 to M-2}even for the memory bank. The other address is the one address plus one. Memory banks having the other 
type of addresses interleave between memory banks having one type of addresses. As a result, the output path mem- 
ory can be determined without any conflict. 

[0047] The M/2 ACS pairs can be parallel-processed within the whole range. 

[0048] Fig. 6 is a block diagram showing an event-driven embodiment as another embodiment of the present inven- 
tion. 

[0049] According to this embodiment, each block starts operating by a start signal from a sequencer 501 , and sends 
back an end signal upon completion of the operation. Limitations imposed by operation clocks necessary for the blocks 
are removed, realizing a clock-free architecture. The operation of this embodiment will be sequentially described. 
[0050] Upon reception of 3-symbol soft decision data, the sequencer 501 sends a serial/parallel conversion start sig- 
nal to a branch metric block 502. Upon reception of the start signal, the branch metric block 502 performs serial/parallel 
conversion, interlocks with the ACS operation to transmit a branch metric, and sends back a conversion end signal to 
the sequencer 501 . 

[0051 ] When branch metric transmission becomes possible, the sequencer 501 sends a generating polynomial circuit 
signal output start & ACS operation start signal to a path metric & ACS block 503. Upon reception of the start signal, 
the path metric & ACS block 503 starts ACS operation while sending a signal corresponding to the ACS operation to a 
generating polynomial circuit. 

[0052] The decision results obtained by the ACS operation are sent to a path memory write block 504, and written by 
a write start signal from the sequencer 501 . 

[0053] This operation is repeatedly performed for all states decided from the three symbols. The sequencer 501 stops 
supply of operation clocks to each block and saves the battery until next three symbols are input. For the next three 
symbols, the same processing is repeatedly performed to complete one frame processing. 

[0054] After one-frame data is written, the path memory write block 504 sends back a 1 -frame end signal to the 
sequencer 501. Upon reception of this signal, the sequencer 501 sends a traceback start signal to a path memory 
traceback block 505. Upon reception of the start signal, the traceback block 505 starts traceback and stores the results 
in an FILO (First In Last Out) circuit. Upon completion of one-frame traceback, the traceback block sends back an end 
signal to the sequencer 501. 

[0055] The sequencer 501 sends a data output & CRC check start signal to a data output block 506. The data output 
block 506 sends data and checks CRC in synchronism with the transmission timing, and sends an end signal to the 
sequencer 501 upon completion of the operation. 

[0056] When each block finishes its operation, the sequencer 501 stops supply of operation clocks to the block to set 
the battery saving mode. The sequencer 501 receives one-frame data whose length is time-compressed, and performs 
traceback during the idle times between frames. Accordingly, the sequencer 501 can successively receive frames. 
[0057] This embodiment is of the event-driven type by start and end signals. Even if the symbol rate changes, data 
can be processed within the maximum operation speed of each block. The battery is saved for a block having com- 
pleted its operation, and the power consumption can be flexibly reduced. Further, the blocks can be independently 
tested. 

[0058] Figs. 7A and 7B are explanatory views of still another embodiment of the present invention in which the stages 
of path memories are truncated by the length of a plurality of blocks in traceback such that the truncation range overlaps 
preceding and succeeding truncation ranges in units of blocks. 

[0059] In Fig. 7A, a path memory 601-1 is first used. When decision data are stored to the last address in the path 
memory 601-1, data detected from a traceback start position set by a maximum likelihood decision circuit to a hall-trace 
back position are used as compound data. 

[0060] The path memory changes to a path memory 601-2. When half path memory data are added, data are similarly 
traced backfrom the last maximum likelihood decision position, and only the second half results are used as compound 
data. 

[0061] With the same processing for path memories 601-3 data are traced back to overlap each other, and the 
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ranges truncated in units of blocks are compounded. The path memories 601-1 , 601-2, 601-3,... construct a ring buffer 
structure in which the first and second halves are linked. Accordingly, the ACS decision results are seamlessly stored. 
[0062] The traceback start memory address position is alternately controlled between points A and B, as shown in 
Fig. 7B. In the first embodiment, since traceback is performed between frames, the traceback time is ensured by time- 
compressing data in units of frames. In the embodiment shown in Figs. 7A and 7B, since frames are divided in units of 
blocks, the traceback time is ensured by time-compressing data in units of half blocks. 

Claims 

1. A Viterbi decoder for Viterbi-decoding an input signal, comprising: 

a path memory; 

a shift register having at least (constraint length -1) bits as the number of stages; and 

a back-trace circuit for inputting, to said shift register, an AND per bit between a signal generated by a decoder 

connected to said shift register and a content of said path memory designated by a traceback address counter. 

2. A Viterbi decoder comprising: 

a traceback circuit including a path memory constituted by a first RAM; 

an ACS circuit including a path metric memory constituted by a second RAM; and 

a branch metric circuit, 

wherein said traceback circuit including the path memory has a shift register having at least (constraint length 
-1) bits as the number of stages, a signal generated by a decoder connected to the shift register and a content 
of the path memory designated by a traceback address counter are ANDed in units of bits, and the AND is 
input to the shift register. 

3. A decoder according to claim 2, wherein the path memory constituted by the first RAM is divided into a plurality of 
blocks, at least one upper bit of the shift register forms a chip select signal, the divided path memory is selected by 
the chip select signal, a decoding result of remaining bits of the shift register and a content of the path memory 
selected by the traceback counter and the chip select signal are ANDed in units of bits, and the AND is input to the 
shift register. 

4. A decoder according to claim 2 or 3, wherein the path metric memory constituted by the second RAM is divided 
into a plurality of blocks, each of the divided path metric memories comprises an ACS circuit, and sums corre- 
sponding to surviving paths selected by the ACS circuits are written in units of the divided memory groups in storing 
the sums as maximum likelihood of a state in the path metric memory. 

5. A decoder according to claim 2. 3 or 4, further comprising an ACS circuit and a traceback circuit which start oper- 
ating by an ACS start signal and a traceback start signal, said ACS circuit and said traceback circuit being event- 
driven by the respective start signals. 

6. A decoder according to claim 2, 3, 4 or 5, wherein when path memory stages in the traceback circuit are truncated 
by an appropriate length corresponding to a plurality of blocks, a truncation range overlaps preceding and succeed- 
ing truncation ranges in units of blocks, a traceback start position is set by a maximum likelihood decision circuit, 
and a start block of one surviving path in a truncation symbol for a state selected by traceback is used as decoded 
data. 

7. A decoder according to claim 6, wherein the path memory divided into the plurality of blocks is a ring buffer rotating 
in units of blocks, traceback starts from last data in a last block, and data corresponding to a start block is used as 
decoded data. 

8. A decoder according to claim 6 or 7, wherein a ring buffer formed by the path memory divided into the plurality of 
blocks is a RAM, and data update in units of blocks is realized by address control. 

9. A decoder according to claim 6, 7 or 8, wherein input data corresponding to each of the blocks is time-compressed 
in units of blocks, and ACS processing and traceback are performed at a compressed timing. 
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